Diversity-aware Evaluation for Paraphrase Patterns
نویسندگان
چکیده
Common evaluation metrics for paraphrase patterns do not necessarily correlate with extrinsic recognition task performance. We propose a metric which gives weight to lexical variety in paraphrase patterns; our proposed metric has a positive correlation with paraphrase recognition task performance, with a Pearson correlation of 0.5~0.7 (k=10, with “strict” judgment) in a statistically significant level (p-value<0.01).
منابع مشابه
Pivot Approach for Extracting Paraphrase Patterns from Bilingual Corpora
Paraphrase patterns are useful in paraphrase recognition and generation. In this paper, we present a pivot approach for extracting paraphrase patterns from bilingual parallel corpora, whereby the English paraphrase patterns are extracted using the sentences in a foreign language as pivots. We propose a loglinear model to compute the paraphrase likelihood of two patterns and exploit feature func...
متن کاملExpanding Paraphrase Lexicons by Exploiting Lexical Variants
This study tackles the problem of paraphrase acquisition: achieving high coverage as well as accuracy. Our method first induces paraphrase patterns from given seed paraphrases, exploiting the generality of paraphrases exhibited by pairs of lexical variants, e.g., “amendment” and “amending,” in a fully empirical way. It then searches monolingual corpora for new paraphrases that match the pattern...
متن کاملJoint Learning of a Dual SMT System for Paraphrase Generation
SMT has been used in paraphrase generation by translating a source sentence into another (pivot) language and then back into the source. The resulting sentences can be used as candidate paraphrases of the source sentence. Existing work that uses two independently trained SMT systems cannot directly optimize the paraphrase results. Paraphrase criteria especially the paraphrase rate is not able t...
متن کاملGene Probe Designing for Evaluation of the Diversity of Bradyrhizobium japonicum Isolates
Many researchers consider the use of different probes for hybridization assays as suitable for studying the genetic diversity of nitrogen fixing bacteria. In this study for asessing genetic diversity among Bradyrhizobium japonicum isolates, two different probes (sucA and topA) chosen from the chromosomal genome of Bradyrhizobium strain USDA 110 were designed, evaluated by DNAMAN software and im...
متن کاملExtracting paraphrase patterns from bilingual parallel corpora
Paraphrase patterns are semantically equivalent patterns, which are useful in both paraphrase recognition and generation. This paper presents a pivot approach for extracting paraphrase patterns from bilingual parallel corpora, whereby the paraphrase patterns in English are extracted using the patterns in another language as pivots. We make use of log-linear models for computing the paraphrase l...
متن کامل